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Amendments to the Specification 

Please replace paragraphs [0019]-[0020] with the following replacement 
paragraph. 

f 

[0019] A first step 1 in the method illustrated in Figure 1, identifies one or more 
alignment domains in each sequence corresponding to a structure returned from a 
database query. An alignment domain refers to: 1) to a sequence domain that is 
evolutionarily conserved across related sequences, or 2) the sequence domain 
corresponding to a structural fold that is conserved across related structures. One 
method for annotating one more alignment domains in a sequence uses RPS-BLAST in 
combination with a database of annotated sequence domains or sequence profiles 
such as the Pfam Database, the SMART Database, or the COG database to search a 
sequence for domains annotated in these respective databases. Links to each of 
these databases and a link for downloading RPS-BLAST is available at on the internet 
http://www.ncb i .n l m.n i h.gov/Structuro/cdd/cdd.shtm l . Cut-off e-values for identifying a 
sequence domain may vary depending upon the required confidence of a domain 
identification, but an exemplary, suitable, cut-off e-value ranges from 1CT 4 to 1CT 2 , with 
10" 3 preferred. 

[0020] Alignment domains may also be based upon structure-structure alignment 
domains. If a sequence corresponding to a structure returned from a database query is 
in the Protein Data Bank, and annotated in the SCOP database, 
http://scop.borko l oy.odu/, SCOP structural annotations may be used to annotate it. In 
addition to using domain databases, one or more sequence domains may be identified 
in a sequence (referred to as the query sequence) by a method comprising the steps of: 
1 ) determining a plurality of related template sequences to the query sequence using a 
sequence alignment tool such as the various BLAST, Smith-Waterman or FASTA 
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algorithms and a large protein sequence database such as the NCBI Protein Sequence 

Database, http://www.ncbi.nlm.n i h.gov/ ; 2) identifying putative conserved domains in 

the template sequences based upon their alignments with the query sequence; 3) 

performing a multiple sequence alignment on the related template sequences using a 

multiple sequence alignment tool such as ClustalW. 4ittp://www.ob i . ac.uk/clusta l w/ ; 4) 

identifying one or more conserved domains from the multiple sequence alignment; 5) 

determining one or more domain profiles or Hidden Markov Models ("HMMs") of the 

domains identified in step 4); and 6) identifying an alignment domain with a profile or 

HMM determined in step 5). A domain profile may be determined from a domain 

identified in a multiple sequence alignment using PSI-BLAST, 

http://www'1 .ncbi.nlm.n i h.gov/Education/BLAST i nfo/ps i 1 .htm l . A Hidden Markov Model 
may be built from a multiple sequence alignment using HMMER , ava il ab l e for down l oad 
at http://hmmor.wust l^y/ . Methods for determining putative conserved alignment 
domains based upon alignment data are within the capacity of one ordinarily skilled in 
the art. Sonnhammer, E.L.L., and Kahn, D., Modular Arrangement of Proteins as 
Inferred from Analysis of Homology, PROTEINS: Structure, Function and Genetics, 
3:482-492 (1994) discusses the methods for identifying one or more putative conserved 
alignment domains from sequence alignment data. Sonnhammer, E.L.L., Eddy, S.R., 
and Durbin, R., Pfam: A Comprehensive Database of Protein Domain Familes Based 
on Seed Alignments, PROTEINS: Structure, Function and Genetics, 28:405-420 (1997) 
details how the alignment domains in the PFAM database are determined. 

Please replace paragraph [0029] with the following replacement paragraph. 
[0029] A sixth step 9, receives one or more sequence selections made by the user 
using the means presented to the user for selecting a sequence thereby selecting 13 
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their corresponding structures for subsequent processing. Protein structure processing 

refers to the processing of the structure coordinate files corresponding to the selected 

proteins. Exemplary processing includes but is not limited to, visualization of the 

protein structure or further proteomic analysis such as, fold identification, the 

identification of functional sites on the protein, virtual ligand screening or small molecule 

docking. As one ordinarily skilled in the art will appreciate, the methods according to 

the invention are agnostic as to the nature of the post selection protein structure 

processing. If a structure has been selected for visualization, its coordinate file may be 

processed with protein structure viewing software in order to display a three 

dimensional model of the protein to a user. Exemplary software for displaying protein 

structures includes but is not limited to, Rasmol , av a i l ab lo for down l oad at 

http://www.rasmol.org/ , Cn3D ava i lab le for down l oad at 

http://www.ncb i .n l m. nih.qov/Structur e /CN3D/cn3d.s ^tmt . Molscript , avai l abl e for 

download at, http://www.avatar.so/molscr i pt/ . MolMol ava i lab le for down l oad at 

http://www.mol.b i ol.othz.ch/wuthrich/software/mo l mo t/ , and the Insight II software suite 

available from Accelrys, Inc., (San Diego, California). For those structure viewers that 

recognize PDB structure files, a protein structure file may be formatted accordingly. 

Soo http://www.rcsb.Org/pdb/docs/format/pdbguid o 2.2/quido2.2 framo.htm l . For those 

current or future viewers that do not recognize PDB structure files, a script may be 

written, using either the native scripting features in a viewer, or in an external scripting 

language, to format a structure file in a format that is recognized by a particular viewer. 

Please replace paragraph [0050] with the following replacement paragraph. 
[0050] Programming for displaying protein structures based upon their structural 
coordinates 130, as used herein, refers to machine code, that when executed by the 
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processor, displays protein structures to the user via the output device, 123, based 

upon their structural coordinates. Exemplary software for displaying protein structures 

includes but is not limited to, Rasmol, avai l ab l o for download at http://www.rasmo l .org/, 

Cn3D ava i lab l o for down l oad at 

http://www.ncbi. n l m.n i h.gov/Structure/CN3D/cn3d.s ^tmt t Molscript , availab le for 
down l oad at, http://www.avatar.SG/mo l scr i pt/ . MolMol ava il ab l o for download at 
http://www.mo l .b i o l . e thz.ch/wuthr i ch/softwar e /molni Gt/ , and the Insight II software suite 
available from Accelrys, Inc., (San Diego, Ca). 
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